Web Structure Mining using Learning Automata and Probabilistic Grammar
نویسندگان
چکیده
One of the data mining techniques on the web is web usage mining. The purpose of this mining, extracting useful information from the user interaction data that can be obtained when using the web. By extracting this information, we can determine the relationship between the web pages and then we can do operations such as clustering and ranking web pages. The purpose of this article is to determine the relationship between web pages, using probabilistic grammar and learning automata. In the proposed algorithm first probabilistic grammar generated and association rules are extracted from the user navigation paths stored in log files. Extracted association rules evaluated by learning automata and the relationship between web pages by learning automata is determined. The simulation results compared with Antweb, Bollen and E-DLA shows that the proposed method has a considerable efficiency.
منابع مشابه
Optimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining
The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...
متن کاملUsing an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks
Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...
متن کاملDetermining the Similarity of Web Pages based on Learning Automata and Probabilistic Grammar
As the number of web pages increases, search for useful information by users on web sites will become more significant. By determining the similarity of web pages, search quality can be improved; hence, users can easily find their relevant information. In this paper, distributed learning automata and probabilistic grammar were used to propose a new hybrid algorithm in order to specify the simil...
متن کاملFocused Crawling using Asynchronous Cellular Learning Automata
Web crawling is used to collect the web pages which will be indexed by a search engine. The search engine uses these crawled and indexed pages to answer users’ queries. Since the volume of web pages is very high and it increases continuously, search engines can index a limited number of web pages. Therefore, in recent years, the focused crawler algorithms have been introduced which act selectiv...
متن کاملA Link Prediction Method Based on Learning Automata in Social Networks
Nowadays, online social networks are considered as one of the most important emerging phenomena of human societies. In these networks, prediction of link by relying on the knowledge existing of the interaction between network actors provides an estimation of the probability of creation of a new relationship in future. A wide range of applications can be found for link prediction such as electro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013